NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Iterative Linear Quadratic Optimization for Nonlinear Control: Differentiable Programming Algorithmic Templates

https://doi.org/10.5802/ojmo.32

Roulet, Vincent; Srinivasa, Siddhartha; Fazel, Maryam; Harchaoui, Zaid (November 2024, Open Journal of Mathematical Optimization)

Full Text Available
Distributionally Robust Optimization with Bias and Variance Reduction

Mehta, Ronak; Roulet, Vincent; Pillutla, Krishna; Harchaoui, Zaid (May 2024, OpenReview)
OpenReview (Ed.)
We consider the distributionally robust optimization (DRO) problem with spectral risk-based uncertainty set and f-divergence penalty. This formulation includes common risk-sensitive learning objectives such as regularized condition value-at-risk (CVaR) and average top-k loss. We present Prospect, a stochastic gradient-based algorithm that only requires tuning a single learning rate hyperparameter, and prove that it enjoys linear convergence for smooth regularized losses. This contrasts with previous algorithms that either require tuning multiple hyperparameters or potentially fail to converge due to biased gradient estimates or inadequate regularization. Empirically, we show that Prospect can converge 2-3× faster than baselines such as stochastic gradient and stochastic saddle-point methods on distribution shift and fairness benchmarks spanning tabular, vision, and language domains.
more » « less
Full Text Available
Stochastic Optimization for Spectral Risk Measures

Mehta, Ronak; Roulet, Vincent; Pillutla, Krishna; Liu, Lang; Harchaoui, Zaid (April 2023, Proceedings of The 26th International Conference on Artificial Intelligence and Statistics)
Ruiz, Francisco; Dy, Jennifer; an de Meent, Jan-Willem (Ed.)
Spectral risk objectives – also called L-risks – allow for learning systems to interpolate between optimizing average-case performance (as in empirical risk minimization) and worst-case performance on a task. We develop LSVRG, a stochastic algorithm to optimize these quantities by characterizing their subdifferential and addressing challenges such as biasedness of subgradient estimates and non-smoothness of the objective. We show theoretically and experimentally that out-of-the-box approaches such as stochastic subgradient and dual averaging can be hindered by bias, whereas our approach exhibits linear convergence.
more » « less
Full Text Available
On the Convergence of the Iterative Linear Exponential Quadratic Gaussian Algorithm to Stationary Points

https://doi.org/10.23919/ACC45564.2020.9147694

Roulet, Vincent; Fazel, Maryam; Srinivasa, Siddhartha; Harchaoui, Zaid (July 2020, 2020 American Control Conference)

A classical method for risk-sensitive nonlinear control is the iterative linear exponential quadratic Gaussian algorithm. We present its convergence analysis from a first-order optimization viewpoint. We identify the objective that the algorithm actually minimizes and we show how the addition of a proximal term guarantees convergence to a stationary point.
more » « less
Full Text Available
Iterative Linearized Control: Stable Algorithms and Complexity Guarantees

Roulet, Vincent; Srinivasa, Siddhartha; Drusvyatskiy, Dmitriy; Harchaoui, Zaid (June 2019, Proceedings of Machine Learning Research)

We examine popular gradient-based algorithms for nonlinear control in the light of the modern complexity analysis of first-order optimization algorithms. The examination reveals that the complexity bounds can be clearly stated in terms of calls to a computational oracle related to dynamic programming and implementable by gradient back-propagation using machine learning software libraries such as PyTorch or TensorFlow. Finally, we propose a regularized Gauss-Newton algorithm enjoying worst-case complexity bounds and improved convergence behavior in practice. The software library based on PyTorch is publicly available.
more » « less
Full Text Available
Iterative Linearized Control: Stable Algorithms and Complexity Guarantees

Roulet, Vincent; Srinivasa, Siddhartha; Drusvyatskiy, Dmitriy; Harchaoui, Zaid (January 2019, Proceedings of the 36th International Conference on Machine Learning)

We examine popular gradient-based algorithms for nonlinear control in the light of the modern complexity analysis of first-order optimization algorithms. The examination reveals that the complexity bounds can be clearly stated in terms of calls to a computational oracle related to dynamic programming and implementable by gradient back-propagation using machine learning software libraries such as PyTorch or TensorFlow. Finally, we propose a regularized Gauss-Newton algorithm enjoying worst-case complexity bounds and improved convergence behavior in practice. The software library based on PyTorch is publicly available.
more » « less
Full Text Available
A Smoother Way to Train Structured Prediction Models

Pillutla, Venkata K; Roulet, Vincent; Kakade, Sham M; Harchaoui, Zaid (January 2018, Advances in Neural Information Processing Systems 31)

We present a framework to train a structured prediction model by performing smoothing on the inference algorithm it builds upon. Smoothing overcomes the non-smoothness inherent to the maximum margin structured prediction objective, and paves the way for the use of fast primal gradient-based optimization algorithms. We illustrate the proposed framework by developing a novel primal incremental optimization algorithm for the structural support vector machine. The proposed algorithm blends an extrapolation scheme for acceleration and an adaptive smoothing scheme and builds upon the stochastic variance-reduced gradient algorithm. We establish its worst-case global complexity bound and study several practical variants. We present experimental results on two real-world problems, namely named entity recognition and visual object localization. The experimental results show that the proposed framework allows us to build upon efficient inference algorithms to develop large-scale optimization algorithms for structured prediction which can achieve competitive performance on the two real-world problems.
more » « less
Full Text Available

Search for: All records